Phonetic-context mapping in language identification

نویسندگان

  • Jirí Navrátil
  • Werner Zühlke
چکیده

This paper deals with the problem of exploiting information from a wide phonetic context for the purpose of language identi cation. Two approaches to language modeling are presented here: 1) modi ed bigrams with a context-mapping matrix and 2) language models based on binary decision trees. Both models were incorporated in a phonotactic language identi er with a double-bigram decoding architecture and were shown to consistently improve the performance of standard bigrams. Measured on the NIST'95 evaluation set, the described system outperforms the state-of-the-art phonotactic components and is, at the same time, computationally less expensive.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مقایسه روش های طیفی برای شناسایی زبان گفتاری

Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...

متن کامل

The Effect of English Vowel-Recognition Training on Beginner and Advanced Iranian ESL Learners

This study was an attempt to investigate the effect of vowel-recognition training on beginner and advanced Iranian ESL learners. A total of 36 adult Iranian ESL learners (18 advanced and 18 beginners) who were students of various majors at Memorial University (MUN) were recruited for the study. Advanced participants had the experience of living in Canada for at least three years while beginners...

متن کامل

Context-sensitive probabilistic phone mapping model for cross-lingual speech recognition

This paper presents a probabilistic phone mapping model (PPM) that makes possible automatic speech recognition using a foreign phonetic system. We formulate the training of the phone mapping model in the framework of maximum likelihood estimation. The model can be learned automatically from the reference phonetic transcript and the phonetic transcript resulting from a foreign phonetic recognise...

متن کامل

An efficient phonotactic-acoustic system for language identification

This paper presents a combined two-component system for language identiication based on phonotactic and acoustic features. The phonotactic part consisting of a multilingual phone-recognizer with a double bigram-decoding architecture and a phonetic-context mapping is supported by a second part with pronunciation modeling of the recognized phone-sequence using Gaussian density models. Both parts ...

متن کامل

Spoken word identification by native and nonnative speakers of English: effects of training, modality, context and phonetic environment

Several experiments explored the contribution of visual information (lip movements) to spoken word identification by Japanese and Korean learners of English as a second language (ESL) and native speakers (NSs) of English, and its interaction with sentence context, phonetic environment and, for ESL learners, perceptual training (involving /m,l,p,f,},s/) using

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997